Exploring Highly Structure Similar Protein Sequence Motifs using Granular Computing Model based on Adaptive FCM
نویسندگان
چکیده
Protein sequence motifs are very important to the analysis of biologically significant conserved regions to determine the conformation, function and activities of the proteins. These sequence motifs are identified from protein sequence segments generated from large number of protein sequences. All generated sequence segments may not yield potential motif patterns. In this paper, short recurring segments of proteins are explored by utilizing a granular computing strategy. Initially, Fuzzy C-Means (FCM) and Adaptive Fuzzy C-Means clustering algorithms (AFCM) are used to separate the whole dataset into several smaller informational granules and then succeeded by KMeans and Rough K-Means clustering algorithms on each granule to obtain the final results. By comparing the results of two different granular techniques shows that Adaptive FCM granular with Rough K-Means clustering is capable to capture better motif patterns suggests that our granular computing model which combined AFCM granular with Rough K-Means have a high chance to be applied in some other bioinformatics research fields.
منابع مشابه
Exploring Highly Structure Similar Protein Sequence Motifs using SVD with Soft Granular Computing Models
Vital areas in Bioinformatics research is one of the Protein sequence analysis. Protein sequence motifs are determining the structure, function, and activities of the particular protein. The main objective of this paper is to obtain protein sequence motifs which are universally conserved across protein family boundaries. In this research, the input dataset is extremely large. Hence, an efficien...
متن کاملProtein Sequence Motif Detection using Novel Rough Granular Computing Model
Protein sequence motifs information is essential for the analysis of biologically significant regions. Discovering sequence motifs is a key task to realize the connection of sequences with their structures. Protein sequence motifs have the potential to determine the function and activities of the proteins. Many algorithms or techniques are used to determine motifs which require a predefined fix...
متن کاملSoft Granular Computing Model for Identifying Protein Sequence Motif Based on Svd-entropy Method
Bioinformatics is a field devoted to the interpretation and analysis of biological data using computational techniques. In recent years the study of bioinformatics has grown tremendously due to huge amount of biological information generated by scientific community. Proteins are made up of chain of amino acids. Protein sequence motifs are small fragments of conserved amino acids often associate...
متن کاملFgk Model: an Efficient Granular Computing Model for Protein Sequence Motifs Information Discovery
Discovering protein sequence motif information is one of the most crucial tasks in bioinformatics research. In this paper, we try to obtain protein recurring patterns which are universally conserved across protein family boundaries. In order to achieve the goal, our dataset is extremely large. Therefore, an efficient technique is required. In this article, short recurring segments of proteins a...
متن کاملNovel efficient granular computing models for protein sequence motifs and structure information discovery
Protein sequence motifs have the potential to determine the conformation, function and activities of the proteins. In order to obtain protein sequence motifs which are universally conserved across protein family boundaries, unlike most popular motif discovering algorithms, our input dataset is extremely large. As a result, an efficient technique is demanded. We create two granular computing mod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014